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ABSTRACT 

We have generated a series of composite QSO spectra using over 22000 individual 
low resolution (~ 8A) QSO spectra obtained from the 2dF (18.25 < bj < 20.85) and 
6dF (16 < bj < 18.25) QSO Redshift Surveys. The large size of the catalogue has 
enabled us to construct composite spectra in relatively narrow redshift (Az = 0.25) 
I and absolute magnitude (AMb = 0.5) bins. The median number of QSOs in each 

, composite spectra is ~ 200, yielding typical signal-to- noise ratios of ~ 100. For a 

' given redshift interval, the composite spectra cover a factor of over 25 in luminosity. 

I For a given luminosity, many of the major QSO emission lines (e.g. Mg II A2798, [O II] 

(N . A3727) can be observed over a redshift range of one or greater. 

Using the composite spectra we have measured the line strengths (equivalent 
^ <— I widths) of the major broad and narrow emission lines. We have also measured the 

O equivalent width of the Ca II A3933 K absorption feature due to the host galaxy of the 

AGN. Under the assumption of a fixed host galaxy spectral energy distribution (SED), 
the correlation seen between Ca II K equivalent width and source luminosity implies 
Lgai oc £qgo ±0 05 ■ We find strong anti-correlations with luminosity for the equivalent 
widths of [O II] A3727 and [Ne V] A3426. These provide hints to the general fading 
of the NLR in high luminosity sources which we attribute to the NLR dimensions 
becoming larger than the host galaxy. This could have important implications for the 
search for type 2 AGN at high rcdshifts. If average AGN host galaxies have SEDs 
$_i ■ similar to average galaxies, then the observed narrow [O II] emission could be solely 

due to the host galaxy at low luminosities (Mb ~ —20). This suggests that the [O II] 
line observed in high luminosity AGN may be emitted, in a large part, by intense star- 
forming regions. The AGN contribution to this line could be weaker than previously 
assumed. 

We measure highly significant Baldwin effects for most broad emission lines (C IV 
A1549, C III] A1909, Mg II A2798, B/y, H/3) and show that they are predominantly 
due to correlations with luminosity, not redshift. We find that the H(3 and H7 Balmer 
lines show an inverse Baldwin effect and are positively correlated with luminosity, 
unlike the broad UV lines. We postulate that this previously unknown effect is due 
to a luminosity dependent change in the the ratio of disk to non-disk continuum 
components. 

Key words: galaxies: active - quasars: general - quasars: emission lines - galaxies: 
stellar content 



1 INTRODUCTION 

The correlation of QSO emission line properties with lumi- 
* scroom@aaoepp.aao.gov.au nosity is a straightforward yet potentially highly informative 
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test of standard physical models for AGN. Since the dis- 
covery of an anti-correlation between the equivalent width 
(W\) of the C IV A1549 emission line and continuum lumi- 
nosity (L) by Baldwin (1977), a significant amount of ef- 
fort has been expended quantifying this relationship (here- 
inafter referred to as the Baldwin effect), and investigating 
similar correlations with other QSO emission lines (Bald- 
win, Wampler & Gaskell 1989; Zamorani et al. 1992; Green, 
Forster & Kuraszkiewcz 2001). The results have revealed 
that the anti-correlation with luminosity is relatively weak, 
typically W\ oc L' 3 , with = —0.2 and large scatter. Simi- 
lar correlations have been seen in most other broad emission 
lines including Mg II A2798, C III] A1909, Si IV+O IV ] A1400 
and Lya with -0.4 < < -0.1 dCrecn et al. 200 It has 
also been claimed (Green et al. 2001) that the Baldwin ef- 
fect may be dominated by an even stronger anti-correlation 
with redshift. However, in the magnitude-limited QSO sam- 
ples that have been studied to date, it is extremely difficult 
to disentangle the effects of redshift and luminosity. It it typ- 
ically only possible to access 1—1.5 magnitudes at any given 
redshift, given the steep slope of the QSO luminosity func- 
tion for magnitude-limited samples with B < 19.5 (Boyle, 
Shanks & Peterson 1988). 

A further limitation of existing studies is that it is diffi- 
cult to study the correlation with luminosity and/or redshift 
for weaker lines, in particular the narrow line region (NLR). 
The spectra used in such analyses are typically 'survey' qual- 
ity, i.e. relatively low signal-to-noise ratio (SNR ~ 5 — 10) 
and thus narrow emission lines can be difficult to detect in 
individual spectra. 

Composite QSO spectra have been ge nerated from most 



large Q SO sur veys over the past decade (Boyle 1990 



Fran- 



cis et a I. 1991), providing a detailed picture of the ensem- 
ble average spectral properties of the QSO sample. Typical 
SNRs in these spectra approach or even exceed 100, with 
even relatively weak emission lines (e.g. [Ne V] A3426) easily 
detectable. However, previous surveys have been too small 
(comprising 1000 QSOs or less) to generate composite spec- 
tra as a function of both luminosity and redshift with which 
to examine correlations. With the recent advent of much 
larger QSO surveys such as the 2dF QSO Redshift Survey 
(2QZ, Croom et al. 2001) and and Sloan Digital Sky Survey 
(SDSS, Vanden Berk et al. 2001; Schneider et al. 2002) we 
may now use composite, rather than individual spectra, to 
investigate the correlation of QSO spectral properties with 
luminosity and redshift in much greater detail that has hith- 
erto been possible. 

In this paper we describe the result of an analysis of 
composite QSO spectra based on the almost 22000 QSOs 
observed to date (January 2002) in the 2QZ. The bulk of 
these objects lie around the break in the luminosity func- 
tion, thus providing a better sampling in luminosity at any 
given redshift than QSO surveys at the bright end of the LF 
(e.g. the Large Bright Quasar Survey; Hewett, Foltz & Chaf- 
fee 1995). Moreover, we have also included a few hundred 
brighter QSOs observed with the new 6-degree Field (6dF) 
multi-object spectrographic facility on the UK Schmidt Tele- 
scope (Croom et al. in preparation) to increase the lumi- 
nosity range studied at any given redshift to typically 3-4 
magnitudes. As well as providing a wide baseline over which 
to study correlations such as the Baldwin effect, this sam- 



pling of the QSO (L, z) plane provides an opportunity to 
disentangle the effects of luminosity and redshift. 

In Section 2 we describe the data used in our analy- 
sis, while in Section 3 we discuss the methods used to gen- 
erate the composite spectra and measure the spectral line 
equivalent widths. In section 4 we present the results of our 
analysis, we then discuss these in the context of theoretical 
models in section 5. 



2 DATA 

2.1 Generation of composite spectra 

The data used in our analysis is taken from the 2dF and 
6dF QSO Redshift Surveys (2QZ, Croom et al 2001; 6QZ 
Croom et al. in preparation) . QSO candidates were selected 
for observation based on their stellar appearance and blue 
colours found from APM measurements of UK Schmidt 
Telescope (UKST) photographic plates and films in the u, 
bj and r bands. The 2QZ/6QZ area comprises 30 UKST 
fields arranged in two 75° x 5° declination strips centred on 
S = -30° and 5 = 0°. The 5 = -30° strip extends from 
a = 21 h 40 to a = 3 h 15 in the South Galactic Cap and the 
equatorial strip from a = 9^50 to a = lA h 50 in the North 
Galactic Cap. The 2QZ and 6QZ sources were selected from 
the same photometric data, the only difference being their 
ranges in apparent magnitude: 18.25 < bj < 20.85 (2QZ) 
and 16.0 < bj < 18.25 (6QZ). The combined datasets thus 
produce a uniform QSO sample over a wide range in lumi- 
nosity. Details of the candidate selection can be found in 
Smith et al. (2002). 

The 2QZ objects were observed over the period Octo- 
ber 1997 to January 2002 using the 2dF instrument at the 
Anglo- Australian Telescope. Observations were made with 
the low dispersion 300B grating, providing a dispersion of 
178.8 A mm -1 (4.3 A pixel -1 ) and a resolution of ~ 8.6 A 
over the range 3700-7900A. Typical integration times were 
55 minutes, in a range of observing conditions (1-2.5 arcsec 
seeing) resulting in median SNR ~ 5 per pixel. The brighter 
6QZ objects used in the present paper were observed in 
September 2001 using the 6dF facility at the UKST. A low 
dispersion 250B grating was used to provide a dispersion of 
286 A mm -1 (3.6 A pixel -1 ) and a resolution of ~ 11.3 A 
over the range 3900-7600A. Exposure times were typically 
100 minutes resulting in median SNR ~ 15 per pixel. 

Data from both 2dF and 6dF were reduced using th e 
pipeline data reduction system 2DFDR (Bailey et al. 2002). 



Identification of spectra and the determination of redshifts 
was carried out by a automated program, AUTOZ (Croom 
et al. 2001; Miller et al. in preparation). Each spectrum was 
checked by eye by two members of the team. In our analy- 
sis below we only include quality class 1 identifications (96 
per cent reliable identification) , these being the best quality 
spectra. We also only take the best spectrum (based on qual- 
ity class and then SNR) of each object in the case where 
there is more than one spectrum available. The combined 
2QZ/6QZ dataset provides us with 22041 independent QSO 
spectra. 

Typical redshift errors are a z = 0.003 and photomet- 
ric errors in the &j band are ~ 0.1 mag. Absolute mag- 
nitudes were computed from the observed photographic fej 
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Table 1. The number of QSOs in each of our absolute magnitude-redshift (Mb — z) bins. For each bin the central redshift and absolute 
magnitude is displayed. The last column shows the total number of QSOs in each magnitude interval over all redshifts. In some Mb — z 
intervals there are only a small number of QSOs. In these cases the spectra in adjacent Mb intervals were combined together, an * or 
t indicates where this has been done. For example, in the z = 0.375 interval, QSOs in the Mb = —24.75, —24.25 and -23.75 bins were 
combined together. 
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Table 2. List of strong spectral features removed before contin- 
uum fitting. A simple linear interpolation is made between two 
'continuum' bands defined on either side of the feature. 



Feature 



blue cont. 
band (A) 



red cont. 
band (A) 



Lya+N V 


1130- 


-1155 


1280- 


-1290 


Si IV+O IV] 


1350- 


-1360 


1445- 


-1470 


C IV+He II 


1445- 


-1470 


1685- 


-1705 


C III]+A1 III 


1800- 


-1830 


1985- 


-2020 


Mg II+Fe II 


2650- 


-2685 


3025- 


-3065 


[O II] 


3675- 


-3705 


3745- 


-3785 


[Ne III] 


3845- 


-3855 


3905- 


-3920 


H<5 


4020- 


-4050 


4165- 


-4200 


H 7 


4220- 


-4270 


4430- 


-4460 


Fe II 


4430- 


-4460 


4710- 


-4760 


H/3+[0 III] 


4710- 


-4760 


5080- 


-5105 


Fe II 


5080- 


-5105 


5450- 


-5500 


He I 


5740- 


-5790 


5940- 


-5980 


Ha 


6320- 


-6380 


6745- 


-6805 



magnitude, after correction for Galactic extinction ( [Schlegej 



et al. 1998), using the K-corrections found by Cristiani & 
Vio (1990). Throughout we assume a flat cosmological world 
model with fi = 0.3, A = 0.7 and H = 70 kms" 1 Mpc" 1 . 



3 METHOD 

We have generated composite QSO spectra in discrete abso- 
lute magnitude (AMb = 0.5 mag) and redshift ( Az — 0.25) 
bins. The bin widths were chosen to give good resolution in 
luminosity and redshift, whilst typically retaining over 100 
QSOs in at least 5 magnitude bins (a factor of 10 in lumi- 
nosity) at each redshift (see Table |l|) . Once QSOs identified 
as broad absorption line (BAL) QSOs had been removed, 
there remained a total of 21102 QSOs with which to gen- 
erate the composite spectra. The most important issue re- 
lating to the construction of the composites was that the 
spectra were not flux calibrated. The effects of differential 
atmospheric refraction, corrector chromatic aberration and 
fibre positioning errors makes obtaining even a relative flux 
calibration for sources extremely challenging. We therefore 
chose not to attempt flux calibration of our spectra. We did 
however correct for absorption due to the atmospheric tel- 
luric bands (the optical fibres also provide some absorption 
in these same bands) . We summed all the spectra in a single 
observation in order to obtain a mean absorption correc- 
tion which was then applied to the data. Also, pixels which 
had anomalously high variance due to residuals of night sky 
emission lines were flagged as bad and discarded from our 
analysis. 

As the spectra were not flux calibrated we decided to 
normalize each spectrum to a continuum level as a function 
of wavelength. This allows us to measure equivalent widths, 
line widths and line centres, however we lose any information 
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concerning continuum shape and absolute line strengths. 
Fitting the continuum relies on defining line-free parts of 
the spectrum. This is not always possible, particularly in re- 
gions of the spectrum dominated by weak Fe II emission. Our 
approach, therefore, was to remove all strong emission line 
features, interpolating linearly between pseudo-continuum 
bands defined on each side of the line. The strong features 
removed, and the continuum bands defined are listed in Ta- 
ble ^. After removing these strong lines, a 4th order poly- 
nomial was fit to each spectrum, which was then used to di- 
vide the spectra, providing an approximate continuum nor- 
malization. In a second step to remove residual large-scale 
features in the spectrum, each spectrum was divided by a 
median filtered version using a wide box-car filter of width 
201 pixels (each spectrum containing 1024 pixels or 1032 
pixels for 2dF and 6dF data respectively). At the edges of 
the spectrum the filter was reduced in size to a minimum 
half-width of 5 pixels. 

The above processing was all carried out in the ob- 
served frame. After continuum normalization the spectra 
were shifted to the rest frame, interpolating linearly onto 
a uniform scale of 1 A pixel -1 . Finally the composite spec- 
tra were produced by taking the median value of each pixel. 
For each pixel the median z and Mb of the contributing 
QSOs was also determined. We can then determine apropri- 
ate values for each feature, and not just each composite. The 
values of z and Mb assigned to each feature are the average, 
over the wavelength range of the feature, of the pixel me- 
dian z and Mb values. We derived errors for each composite 
by looking at the distribution of values to be medianed for 
each pixel. The la errors were taken to be the 68% semi- 
interquartile range of the pixel values divided by the square 
root of the number of objects contributing. We have con- 
structed composites in z — Mb and also composites binned 
in absolute magnitude only. One final composite was made 
from all the spectra (see Fig. |l|). The resulting composites 
are normalized to the pseudo-continuum over most parts of 
the spectrum, except for the 2000-3500A region where our 
procedure treats the Fe II emission bands as though they 
were continuum. Therefore, in our subsequent analysis of 
these spectra we are unable to deduce any results concern- 
ing these broad Fe II features. 

The composites in Fig. [I] (and subsequent figures) are 
plotted when at least 10 individual QSOs contribute to the 
spectrum. It can be seen that as the number of QSOs is re- 
duced the SNR declines. From this plot a number of trends 
can already be seen, with the narrow [Ne V], [O II] and 
[Ne III] showing an anti-correlation of line strength with lu- 
minosity. The broad emission lines of C IV, C III] and Mg II 
also show a similar correlation, appearing to confirm previ- 
ous detections of the Baldwin effect. A further graphical rep- 
resentation of these (and other) correlations is shown in Fig. 
^| which shows the luminosity segregated composites divided 
by the mean composite. This confirms that anti-correlations 
with luminosity are seen for a wide variety of emission lines. 
An anti-correlation is also seen between the strength of the 
Ca II H and K absorption lines and QSO luminosity, con- 
sistent with a picture where the host galaxy luminosity of 
QSOs is only weakly correlated with QSO luminosity. Fi- 
nally we note that the Balmer series (in particular H/3 and 
H7) appears to show a positive correlation with luminosity, 



in contrast to the other emission lines. We will analyse these 
apparent correlations in a quantatative manner below. 

In Fig. |^ we show examples of the composites divided 
into absolute magnitude and redshift bins. These allow us 
to decouple the effects of redshift and luminosity. Figs. ^1 
and b show composite spectra with a fixed luminosity over 
a range of redshifts. There is no obvious evidence for emis- 
sion features varying with redshift in these plots. Figs, ^c 
and d show composites in a fixed redshift interval over a 
range in luminosity. In this case we do see an apparent cor- 
relation between luminosity and some lines (C IV, [Ne V], 
[O II], [Ne III], Ca II K), with the lines is question becom- 
ing weaker with increasing luminosity. To investigate the 
nature of these correlations, in particular whether they are 
primarily a function of luminosity or redshift, we will carry 
out detailed fitting of the spectral features, followed by a 
correlation analysis. 

3.1 Line Fitting Procedure 

The composite spectra exhibit a number of spectral features, 
both in emission and absorption, which their high SNR 
allow us to fit. Twelve of these features, including 3 nar- 
row (forbidden) lines, 7 broad (permitted) emission lines, 
one semi-forbidden line (C III]) and one absorption feature 
(Ca II K), were selected for detailed study. These features 
were chosen because they exhibit large equivalent widths 
(e.g. Lya, C IV) and, in the case of the narrow emission 
lines, are relatively free from contamination by other emis- 
sion lines. 

The local pseudo-continuum on either side of each spec- 
tral feature was fitted with a straight line, using a linear 
least-squares method, and subtracted from the spectrum. 
This continuum was by no means the 'true' continuum as 
the emission lines in QSOs often lie on top of other emis- 
sion lines, in particular broad Fe II features. It was, however, 
relatively flat and close to the feature of interest. The major- 
ity of the strong emission lines in QSO spectra are blended 
with other, weaker, emission lines, usually from different el- 
ements. Additionally, permitted emission lines such as the 
Balmer series often exhibit both a broad and a narrow com- 
ponent which are emitted from physically distinct regions. It 
is therefore necessary to model and remove the contribution 
from these different lines to obtain an accurate measurement 
of the line widths and equivalent widths. 

The overlapping lines contributing to each spectral fea- 
ture were modelled using multi-component Gaussian fits as 
listed in Table ^. We note, however, that assuming a Gaus- 
sian form for the features in our spectra may be a gross 
over-simplification, and future work will endevour to define 
non-parametric measurements of line properties as well as 
these Gaussian fits. Each component was fitted with a Gaus- 
sian of the form 

(A c -A) 2 

F(\) = ae ?i— , (1) 

where a is the peak emission, A c is the wavelength of the 
peak emission and a is the width of the line. When possible 
the number of independent parameters in the model was re- 
duced by linking some of them together. For example, since 
the [O III] AA5007,4959 and narrow H/3 emission arises from 
the same region of the QSO (the narrow line region) it is 
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Figure 1. Composite QSO spectra. Top: all spectra combined into one composite. Bottom: composites computed in absolute magnitude 
intervals (AMj = 0.5) with no redshift binning. The brightest and faintest bins have been made wider to include a sufficient number of 
spectra. All the spectra have a continuum level of one, but have been offset for clarity. 
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Figure 2. The luminosity segregated composite spectra, as shown in Fig. |l| divided by the average composite (top). From this apparent 
correlations can be seen between luminosity and line strength in a number of lines, including [O II], [Ne V], Mg II, C III] and C IV. These 
correlations are discussed in the text. Again, the mean flux ratio in each spectrum is one, but the spectra have been offset for clarity. 
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Figure 3. Examples of the QSO composites generated in absolute magnitude-redshift intervals. Top: constant luminosity examples 
(—25.5 < Mb < —25.0 and —23.5 < Mb < —23.0) over a range of redshifts. Bottom: constant redshift intervals as a function of 
luminosity for 1.5 < z < 1.75 and 0.25 < z < 0.5. 



reasonable to assume that the emitting gas will have simi- 
lar velocity shifts and dispersions. The central wavelengths 
and line widths of the [O III] A4959 and narrow H/3 emission 
were therefore tied to those of the [O III] A5007. Columns 8, 
9 and 10 in Table |^ show how features were tied together. 
For example, all the components of the broad H/3 line are 
free, while the line centres (A c ) and widths (cr) for the two 
[O III] lines and the narrow H/3 line are tied together. 

The narrow emission lines were modelled as single Gaus- 
sians and were restricted to velocity widths < 1500 kms . 
Adequate fits to the Ca II absorption feature and the broad 
Balmer emission lines (H/3, H7 and H<5) were also obtained 
using a single Gaussian, although there is some evidence 
(see Fig. ^) from these high SNR spectra that the broad 
H/3 has an asymmetric non-Gaussian profile. The broad UV 



lines, i.e. from Mg II A2798 blue-ward, display emission line 
profiles with very broad bases which cannot be adequately 
modelled by a single Gaussian. They were therefore fit- 
ted with two components; a very broad Gaussian (FWHM 
~ 10000 kms ) and a narrower component (FWHM ~ 
2000 — 4000 kms" 1 ). This narrower component is not be- 
lieved to be emission from the narrow line region since its 
velocity dispersion is much larger than that measured in the 
narrow lines (e.g. [O II] A3727 and [O III] A5007) which is 
typically ~ 800km s -1 . When two broad components were 
fitted to a broad emission line, the central wavelengths of 
the Gaussians were tied together as we found no evidence for 
a velocity shift between the components. Systematic shifts 
betwe en different components , have been seen by other au- 
thors (Brotherton et al. 1994), and potential line shifts will 
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Figure 4. Example line fits. We plot each line or complex of lines fit in our anaylsis for the case of the total composite spectrum. Shown 
is the data (solid line), the individual Gaussian components (dot-dashed lines), the sum of the Gaussian components (dashed line) and 
the residuals after subtracting the fit (solid line) . In most cases the dashed line denoting the total fit is hardly visible over the data. The 
vertical lines indicate the wavelengths of the primary component (solid line) and secondary contaminating components (dashed lines). 
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Table 3. Spectral features studied. The first column gives the principal emission line in each feature for which line equivalent widths 
were measured. Columns 2 and 3 give the regions of the spectrum used in the continuum fit. Columns 4 to 10 give the properties of 
the individual fitted components. Column 4 lists the component number, column 5 the element/elements causing the emission, column 
6 the laboratory (vacuum) wavelengths of the components and column 7 indicates whether the emission is narrow or broad. Columns 
8, 9 and 10 list the fitted parameters, showing which parameters were tied together. Notes: 1. Unresolved doublets or multiplets, mean 
wavelength quoted. 2. Although this feature is identified here as He II it is actually a blend of several lines including Fe II and OIII] and 
is therefore relatively broad. 3. The [Ne III] A3968 feature is also contaminated by the He A3970 which may be present in emission or 
absorption. 4. When both H7 and H/3 were present in the spectrum, the velocity width of these components were fixed to that measured 
for the [O III] A5007 emission line. 
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be investigated in detail by Corbett et al. (in preparation). 
The only line in which the broad components were not tied 
was Lya, because absorption to the blue side of the line re- 
sulted in an asymmetric profile and it was necessary to allow 
a velocity shift between the two components to fit the line 



prof ile. Previous studies ( Wills et al. 1993 



Brotherton et al 



1994; Brotherton et al. 1994) have highlighted the fact that 



the broad line region can be well described by two compo- 
nents, often described as the intermediate line region and 
the very broad line region. It is clear that the broad UV 
lines in our composite spectra show these two components, 
however we reserve detailed discussion of line shapes for the 
forthcoming paper, Corbett et al. In all cases the best fit to 
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the spectral feature was found using \ 2 minimization tech- 
niques. 

The broad Balmer emission line H7 proved difficult to 
de-blend as it is contaminated by emission from both [Fe II] 
A4358 and [O III] A4363 as well as narrow H7 emission. Since 
the [Fe II] and [O III] emission are within 6A of each other 
they are not resolved in the 2dF spectra and were therefore 
modelled as single narrow component centered between the 
two lines. The fit was further constrained by fixing the ve- 
locity width of the narrow H7 and the combined [Fe II] and 
[O III] lines to that obtained for the [O III] A5007 emission. 
It was not possible to de-blend the O IV] ~ A 1402 multiplet 
emission from the Si IV AA1393,1402 emission and hence the 
equivalent width calculated for Si IV also contains emission 
from O IV]. 

Once the spectral feature had been modelled, the fits 
to the contaminating line emission were subtracted, leaving 
only the line of interest. The total flux in the line was mea- 
sured by integrating the flux over a wavelength range defined 
as A c ± 1.5 x FWHM, where A c is the central wavelength and 
the FWHM is that of the broadest Gaussian component fit- 
ted to the line. The equivalent width of the emission (or 
absorption) was defined as 



Flinc 
-fconl 



A, 



(2) 



where Fjj ne is the integrated flux in the emission/absorption 
line and F cont is the continuum flux measured in a lA bin 
about the central wavelength of the fit to the line. By using 
the integrated residual flux in the spectral feature rather 
than the Gaussian fit to calculate Fn nc , we avoid introducing 
errors due to the fact that the line emission may not be 
perfectly fit by a Gaussian (e.g. H/3). There is, however, an 
uncertainty in Fii ne due to the modelling and subtraction 
of the contaminating line emission and continuum which we 
have taken into account when calculating the errors in W\. 
In general, the multi-component fits to the C III] line were 
degenerate, and so we also calculate the equivalent width 
of the total spectral feature, which is used in the analysis 
below. 



3.2 Correlation analysis 

Once the widths, equivalent widths and line centres were 
measured for all lines fitted above, we tested the data for 
correlations between the line parameters and redshift and 
luminosity. In this paper we report the results for W\. Dis- 
cussion of line widths and centres will be reported elsewhere. 
We have carried out non-parametric rank correlation analy- 
sis, deriving the Spearman rank-order correlation coefficient, 
p. We a priori select 99% to be confidence level at which we 
will claim significant correlations. Specifically we will test for 
an W\ — z correlation by correlating log W\ with log(l + z), 
as evolutionary parameters for QSOs are generally an ap- 
proximate power law in (1 + z), i n particular, QSO lumi- 
nosity evolution ( Boyle et al. 2000[ ) . In testing for W\ — Mb 
correlations we will correlate log W\ with Mb . 

A particularly important issue is to deduce whether z 



or Mb is the primary parameter with which W\ correlates. 
We approach this problem in two ways; the first is to carry 
out correlations in separate z or Mb intervals, removing any 
possible spurious correlations with the second independent 



variable. The second approach is to use partial Spearman 
rank correlation (e.g. Macklin 1982) to derive the correlation 
coefficient holding one independent variable constant: 



PAX.Y 



Pax — PxyPay 
V^-PxyXI-PayT 



(3) 



where X and Y are two independent variables (e.g. z and 
Mb) and A is the dependent variable (e.g. W\). pax, Pay 
and pxY are the Spearman correlation coefficients for the 
separate correlations between two variables. The significance 
of pax.y is given by 



4 / 1 + pAX.Y 

— Pax.y 



(4) 



which is distrib uted normally about zero with unit variance 
(Macklin 1982), where iV is the size of the sample. In using 
this partial rank correlation approach we are testing the null 
hypotheses that i) the W\ — z correlation arises entirely from 
the W\ — Mb and Mb — z correlations, and ii) the W\ — 
Mb correlation arises entirely from the W\ — z and Mb — z 
correlations. If the coefficients for the W\ — z correlation are 
larger than those for the W\ — Mb correlation, this would 
imply that W\ is primarily correlated with z. 

To determine the slope of any measured correlations 
we also carry out fits to the data using the non-linear 
Levenberg-Marquardt method. This method was used in or- 
der to fit a power law while still properly taking into account 
the errors on the W\ measurements (which is not possible 
in a standard linear least-squares approach) . As will be seen 
below, the errors in the individual W\ measurements were 
often much smaller than the scatter about the best fit line. 
This could reflect the fact that parameters other than those 
fitted are introducing extra dispersion and/or the power law 
is not a adequate fit to the data. To obtain a realistic error 
on the fitted parameters, we repeat the fitting procedure, 
rescaling the errors such that the reduced x 2 is exactly one 
in each case, noting the specific cases where the data diverge 
significantly from a power law. 



4 RESULTS 

We derive both the bivariate and partial Spearman rank 
correlation coefficients for the correlation of W\ with z and 
Mb. Our primary aims in doing this are to i) test for the 
existence of any significant correlations, and ii) determine 
if those correlations are primarily with z or Mb. This sec- 



ond p oint is important given the recent claim (Green et al 



2001) that the Baldwin effect is primarily a correlation with 
redshift. 



4.1 Bivariate and partial Spearman rank 
correlation 

We select the strongest and cleanest spectral features to test 
for correlations. In cases where there is significant contami- 
nation by other features the Gaussian fits for these have been 
subtracted off the summed flux to provide a clean estimate 
of line flux. This has not been done in a few cases, where 
the separate components cannot be clearly distinguished. In 
particular, we use the summed flux of all components in 
the Si IV+O IV] complex, and do not subtract off narrow 
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Figure 5. The correlations of log W\ with log(l + z) for each line. The circles denote the measured values of log W\, with larger circles 
indicating more luminous intervals in Mb- No significant correlations are found in individual luminosity intervals. For every feature, 
regardless of whether a significant correlation is seen or not, the best power law fit to all the data points is shown (solid line). 
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Table 4. Spearman rank correlation coefficients for correlations of log(WA) with Mb and log(l + z). For each line tested we give the 
number of points correlated, N, the Spearman rank coefficient, p and the probability of the null hypothesis, P. Full, bivariate coefficients 
and probabilities are given first for the correlations with Mb and log(l + z), then we list the partial correlation values. 



bivariate correlations partial correlations 

logW A vs. M B logWx vs. log(l + z) log W\ vs. Ms logWji vs. log(l + z) 
Line N p P p P p P p P 



Lya 


19 


-0.476 


3.956E-02 


0. 


709 


6.757E-04 


-0. 


.209 


4.121E-01 


0.621 


4.908E-03 


N V 


19 


-0.119 


6.265E-01 


-0 


.059 


8.110E-01 


-0 


.170 


5.055E-01 


-0.135 


5.975E-01 


Si IV 


30 


0.184 


3.303E-01 


-0 


.134 


4.811E-01 





.140 


4.715E-01 


-0.059 


7.642E-01 


C IV 


34 


0.816 


4.058E-09 


-0 


.281 


1.080E-01 





.813 


4.746E-10 


0.258 


1.482E-01 


C III]+A1 III 


49 


0.574 


1.650E-05 


-0 


.646 


5.436E-07 





.313 


2.987E-02 


-0.465 


7.267E-04 


Mg II 


35 


0.493 


2.598E-03 


-0 


.178 


3.058E-01 





.626 


4.236E-05 


0.471 


4.369E-03 


[Ne V] 


23 


0.899 


5.625E-09 


-0 


.720 


1.060E-04 





.803 


1.418E-06 


0.330 


1.355E-01 


[O II] 


30 


0.913 


2.153E-12 


-0 


634 


1.670E-04 





.917 


1.119E-15 


0.658 


5.756E-05 


[Ne III] 


24 


0.334 


1.112E-01 


-0 


.049 


8.196E-01 





.598 


2.052E-03 


0.528 


8.692E-03 


H<5 


22 


-0.240 


2.821E-01 


-0 


.008 


9.702E-01 


-0 


.504 


1.862E-02 


-0.457 


3.648E-02 


H7 


22 


-0.634 


1.529E-03 


0. 


298 


1.786E-01 


-0 


.756 


2.876E-05 


-0.588 


4.189E-03 


H/3 


17 


-0.711 


1.382E-03 


0. 


118 


6.507E-01 


-0 


.897 


1.504E-07 


-0.781 


1.553E-04 


[0 III] 


17 


-0.179 


4.920E-01 


-0 


.012 


9.628E-01 


-0 


.265 


3.280E-01 


-0.199 


4.675E-01 


Ca II K 


17 


0.866 


7.118E-06 


-0 


.603 


1.041E-02 





.939 


4.146E-10 


0.838 


1.226E-05 



components from H<5 or H7 (including [O III] A4363). Note, 
however that we combine together broad and intermediate 
components of the same line (in particular for all the broad 
UV lines). 

We carry out the correlations first for log(l + z) and 
Mb- The resulting correlation coefficients are listed in Ta- 
ble ^| The number of points used in the correlations ranges 
from 17 to 49 with a median of 22. Table [I] first lists the 
full bivariate correlation coefficients and probabilities for the 
log W\ correlations with Mb and log(l + z). These do not 
take into account any potential spurious correlation caused 
by the correlation of Mb and log(l + z). We find that a 
number of lines show significant (P < 0.01) correlations. 
The C IV, C III]+A1 III, Mg II, [Ne V], [O II], Ca II K, H7 
and H/3 lines all show significant correlations with M B . We 
find that less lines, only Lya, C III]+A1 III, [Ne V] and [O II], 
show correlations with log(l + z). The data (filled and open 
circles) and best fit correlations (solid lines) are shown in 
Figs. I and § 

We then derive the partial Spearman rank correlation 
coefficients of logV^A with Mb and log(l + z), which are 
listed in the last four columns of Table ^. In all but two 
cases (Lya, C III] +A1 III) the significance of the correlation 
is larger for Mb than log(l + z). The strongest partial cor- 
relations with log(l + z) are for [O II] and Ca II K which are 
also the lines showing the steepest correlations with Mb. 
This is consistent with these log(l + z) correlations being 
due to the luminosity distribution of the QSOs, within a 
given luminosity range, changing with redshift. Our partial 
correlation analysis demonstrates that the correlations seen 
are primarily with Mb rather than redshift, in disagreement 
with the previous results of Green et al. (2001). 



4.2 The correlation of W\ and redshift 

To further investigate the finding that W\ primarily cor- 
relates with Mb and not z we now use the separate lumi- 
nosity intervals of width AA/b = 0.5 to search for correla- 



tions with redshift. In a given luminosity interval there are 
up to eight Az = 0.25 redshift intervals sampled. For each 
measured emission line we test for a correlation between W\ 
and log(l + z) independently within each luminosity interval 
using Spearman rank correlation. In some luminosity inter- 
vals there may be only a small number of redshift intervals 
in which a particular line is present. This is particularly 
true for lines such as H/3 and [O III] near the edge of the 
spectrum, which are only present in 3 redshift intervals. In 
the Spearman rank correlation analysis we limit ourselves 
to examining luminosity intervals which contain at least 5 
separate measurements. This is because the significance of 
p is derived from t = p^J(N -2)/(l- p 2 ) which is approx- 
imately distributed as Student's distribution. However this 
breaks down for small N, as it predicts zero probability for 
p — ±1, whereas the true likelihood of this occurring is 
(2/AH). Only the Si IV, C IV, C III]+A1 III and Mg II lines 
have 5 or more measured equivalent widths in a given lumi- 
nosity interval, hence only these lines are sensitive to tests 
for log W\ vs. log(l + z) correlations in each luminosity in- 
terval. No significant correlations are found for any of these 
lines, supporting the above finding that the correlations are 
primarily driven by luminosity. Fig. ^ shows the distribution 
log W\ vs. log(l + z) for all the lines. The symbols (circles) 
are larger for brighter luminosity intervals, and in a number 
of cases (e.g. [O II]) we see that the correlation could poten- 
tially be due to intrinsically fainter (small circles) sources 
having larger equivalent widths. The above partial correla- 
tion analysis confirms this impression. 



4.3 The correlation of W\ and Mb 

We now correlate W\ with Mb in separate redshift inter- 
vals of Az — 0.25. Again, only intervals with 5 or more W\ 
measurements are tested for correlations, however each line 
has at least one redshift interval with 5 or more points. Fig. 
^| shows the results of this analysis. We find that all of the 
lines which show significant correlations over the entire red- 
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Figure 6. The correlations of log W\ with Mb for each line. The circles denote the measured values of log W\ , with larger circles 
indicating higher redshift intervals. In cases where an individual luminosity interval shows a significant correlation with redshift the 
circles are filled and the best fit power law correlation is plotted (dashed lines). For every feature, regardless of whether a significant 
correlation is seen or not, the best power law fit to all the data points is also shown (solid line). 
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Table 5. Best fit linear correlations with Spearman rank correlation coefficients and probabilities. First we list the best fits for all 
redshift intervals combined. Then we list correlations from those lines which show significant correlations in individual redshift intervals. 
N is the number of points used in the correlation analysis. A and B are the intercept and gradient for the best fit line, such that 
log W\ = A + BMft . 8 A and SB are the errors on the intercept and slope, p and P are the Spearman rank coefficient and probability 
respectively. Probabilities marked by an asterisk (*) have p = f (a perfect correlation). These probabilities have been corrected to 
P = (2/AT). 



Line 


N 


redshift 


A 


8A 


B 


SB 


P 


P 


Lya 


19 


n nn 
u.uu 


a nn 


0.059 


0.531 


-0.065 


0.020 


-0.476 


3.956E-02 


N V 


19 


n nn 
u.uu 


a nn 


1.124 


1.004 


-0.006 


0.039 


-0.119 


6.265E-01 


Si IV 


30 


n nn 
u.uu 


a nn 


1.127 


0.173 


0.008 


0.007 


0.184 


3.303E-01 


C IV 


34 


n nn 
u.uu 


a nn 


2.905 


0.154 


0.050 


0.006 


0.816 


4.058E-09 


C III]+A1 III 


49 


n nn 
u.uu 


a nn 


2.021 


0.097 


0.026 


0.004 


0.574 


1.650E-05 


Mg II 


35 


n nn 
u.uu 


a nn 


2.103 


0.109 


0.017 


0.005 


0.493 


2.598E-03 


[Ne V] 


23 


n nn 
u.uu 


a nn 


2.557 


0.263 


0.101 


0.012 


0.899 


5.625E-09 


[O II] 


30 


n nn 
u.uu 


a nn 


3.409 


0.222 


0.135 


0.010 


0.913 


2.153E-12 


[Ne III] 


24 


n nn 
u.uu 


a nn 

4.UU 


1.192 


0.325 


0.041 


0.015 


0.334 


1.112E-01 


H<5 


22 


n nn 
u.uu 


a nn 


0.334 


0.370 


-0.020 


0.016 


-0.240 


2.821E-01 


H- f 


22 


n nn 
u.uu 


a nn 


-0.249 


0.257 


-0.067 


0.011 


-0.634 


1.529E-03 


H/3 


17 


n nn 
u.uu 


a nn 


-0.053 


0.353 


-0.073 


0.016 


-0.711 


1.382E-03 


[O III] 


17 


n nn 
u.uu 


a nn 


0.854 


0.235 


-0.017 


0.011 


-0.179 


4.920E-01 


Ca II K 


17 


0.00 


- 4.00 


4.895 


0.488 


0.218 


0.022 


0.866 


7.118E-06 


C IV 


7 


1.75 


- 2.00 


3.260 


0.196 


0.065 


0.008 


1.000 


3.968E-04* 


C IV 


6 


2.00 


- 2.25 


3.240 


0.248 


0.063 


0.010 


0.943 


4.805E-03 


C III]+A1 III 


7 


1.25 


- 1.50 


2.556 


0.163 


0.046 


0.007 


1.000 


3.968E-04* 


C III]+A1 III 


7 


1.75 


- 2.00 


1.984 


0.174 


0.024 


0.007 


0.964 


4.541E-04 


Mg II 


8 


0.50 


- 0.75 


1.948 


0.072 


0.011 


0.003 


0.929 


8.630E-04 


Mg II 


7 


0.75 


- 1.00 


2.314 


0.104 


0.025 


0.004 


0.893 


6.807E-03 


Mg II 


7 


1.00 


- 1.25 


2.461 


0.314 


0.032 


0.013 


0.893 


6.807E-03 


[Ne V] 


8 


0.50 


- 0.75 


3.915 


0.621 


0.161 


0.027 


0.905 


2.008E-03 


[O II] 


8 


0.25 


- 0.50 


3.574 


0.510 


0.143 


0.023 


1.000 


4.960E-05* 


[O II] 


8 


0.50 


- 0.75 


4.500 


0.329 


0.183 


0.014 


0.976 


3.314E-05 


H 7 


8 


0.25 


- 0.50 


-1.006 


0.343 


-0.102 


0.015 


-0.952 


2.604E-04 


H/3 


8 


0.25 


- 0.50 


-0.178 


0.319 


-0.079 


0.014 


-0.905 


2.008E-03 


[O III] 


8 


0.25 


- 0.50 


0.487 


0.205 


-0.034 


0.009 


-0.905 


2.008E-03 


Ca II K 


6 


0.25 


- 0.50 


4.823 


0.794 


0.214 


0.037 


0.943 


4.805E-03 


Ca II K 


6 


0.50 


- 0.75 


8.139 


0.651 


0.360 


0.029 


1.000 


2.778E-03* 



shift interval also show significant correlations in at least 
one individual redshift interval. In fact, C IV, C III]+A1 III, 
Mg II, [O III] and Ca II K all show significant correlations 
with log W\ in two or more redshift intervals. 

We note the odd behaviour of the Mg II equivalent 
width in the lowest redshift interval, exhibiting a trend 
with luminosity in the opposite sense to the other redshift 
ranges. It is difficult to ascribe this to a selection effect; any 
Malmquist bias in the measurement of the equivalent width 
(occurring when a particular line is the dominant or only 
line responsible for the identification of a quasar at a partic- 
ular redshift e.g. Mg II) would likely give rise to the opposite 
effect i.e. a tendency to over-estimate the mean equivalent 
width at the faint magnitudes (lowest luminosities) in the 
sample. 

Indeed, some Malmquist bias may be present in the 
data, although we have confirmed that the results pre- 
sented here are robust against the inclusion/exclusion of the 
faintest 0.5 mag interval in absolute magnitude. Moreover, 
the fact that a range of slopes are found for the correlation 
between equivalent width and Mb (both positive and nega- 
tive) further suggests that any Malmquist bias plays a small 
role. 



In Table q we list the parameters of all the significant 
correlations, including their significance and best fit param- 
eters for the fit to log W\ = A + BMb- From Table || we can 
see that there are significant differences between gradients 
of the different lines. The strongest correlation is found in 
the Ca II K line with a gradient of 0.218±0.022. The Balmer 
lines are the only ones to show a negative correlation with 
Mb (a positive correlation with luminosity), which confirms 
the visual impression gained from Fig. ^. We discuss the 
physical significance of these correlations below. 



4.4 The correlation of W\ with Mb in luminosity 
only divided bins 

Given that the above analysis appears to suggest that the 
dominant correlation is with Mb we now derive correlations 
between log W\ with Mb for the composites sub-divided 
only on the basis of luminosity (Fig. ^]). This can poten- 
tially reduce the noise and scatter in the correlations if W\ 
is truly only correlated with Mb. The resulting correlations 
are shown in Fig. The correlation coefficients and best fit 
parameters are listed in Table ^| We see evidence of signif- 
icant correlations in many lines, with some exceptions. The 
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Table 6. Best fit linear correlations with Spearman rank correlation coefficients and probabilities for composites sub-divided by luminosity 
only. N is the number of points used in the correlation analysis. A and B are the intercept and gradient for the best fit line, such that 
log W\ = A + BMft . 5 A and SB are the errors on the intercept and slope, p and P are the Spearman rank coefficient and probability 
respectively. 



Line 


N 


A 


SA 


B 


SB 


P 


P 


Lyo 


5 


0.822 


0.665 


-0.037 


0.026 


-0.400 


5.046E-01 


N V 


5 


1.781 


1.118 


0.020 


0.043 


0.800 


1.041E-01 


Si IV 


8 


1.118 


0.181 


0.008 


0.007 


-0.143 


7.358E-01 


C IV 


8 


2.917 


0.143 


0.051 


0.006 


0.976 


3.314E-05 


C III] +A1 III 


10 


2.075 


0.080 


0.028 


0.003 


0.976 


1.468E-06 


Mg II 


12 


2.233 


0.116 


0.023 


0.005 


0.720 


8.240E-03 


[Ne V] 


10 


2.423 


0.369 


0.095 


0.016 


0.964 


7.321E-06 


[O II] 


12 


3.330 


0.205 


0.131 


0.009 


0.986 


4.117E-09 


[Ne III] 


11 


0.984 


0.377 


0.031 


0.017 


0.645 


3.196E-02 


H<5 


11 


0.546 


0.265 


-0.009 


0.012 


-0.491 


1.252E-01 


H-y 


13 


-0.203 


0.175 


-0.064 


0.008 


-0.703 


7.319E-03 


H/9 


10 


-0.066 


0.259 


-0.074 


0.012 


-0.927 


1.120E-04 


[0 III] 


10 


0.862 


0.237 


-0.017 


0.011 


-0.527 


1.173E-01 


Ca II K 


9 


4.703 


0.492 


0.208 


0.023 


0.983 


1.936E-06 



lines which do not show correlations are Lya, N V, Si IV, 
[Ne III], H<5 and [O III]. We also note that in some cases, 
most notably C IV, C III]+A1 III and Mg II, the dispersion 
about the best fit correlation is much larger than would be 
expected give the errors on individual points. This suggests 
that other parameters may cause extra dispersion in the re- 
lation, or that a simple power law fit is not actually a good 
description of the underlying physics. 



5 DISCUSSION 

We now attempt to understand the above measured corre- 
lations in the context of simple physical models for AGN 
emission and host galaxy properties. We will start by con- 
sidering the host galaxy, and then look at the narrow line 
and broad line regions in turn. 



5.1 Host galaxy properties 

The Ca II K absorption line is possibly the most simple to 
interpret, as it can only be due to the stars present in the 
host galaxy of the QSO. We clearly see that as the AGN 
luminosity increases, the strength of the Ca II K declines, 
consistent with a picture in which the host galaxy does not 
increase in luminosity as fast as does the AGN. If the host 
galaxy was constant in luminosity, the slope of the corre- 
lation between log Wx and Mb would be 0.4. Our best fit 
slope is 0.208 ± 0.023, which is therefore consistent with 
host galaxy luminosity increasing slowly. In this analysis we 
make the assumption that the average spectral properties 
of the AGN host galaxies do not change significantly with 
luminosity (or redshift). We also assume that there is no 
significant aperture effect introduced by the 2-arcsec diam- 
eter of the 2dF fibres used to obtain the spectra. Based on 
the imaging results of Schade, Boyle & Letawsky (2000), we 
expect that galaxy hosts for QSOs in this luminosity range 
(—21 > Mb > —24) will be bulge-dominated with effective 
radii ranging from 1 - 2kpc for QSOs at z ~ 0.15 and 3 - 



6kpc for QSOs with z ~ 0.6. In both cases the projected 
size of the bulges are is approximately the same size on the 
sky (~ 0.75 - 1.5-arcsec diameter) and significantly less than 
the fibre diameter. 

If we also assume that the majority of the continuum 
emission is due to the QSO, we can derive a simple relation 
for the expected correlation between Wx and Mb- We can 
set Liine oc Lg a i oc Lq SO , that is, the QSO luminosity is 
proportional to the host galaxy luminosity to some power. 
If a = then the host galaxy has a constant luminosity. 
Converting to magnitude and log W\ we then find 



log W x = 0.4(1 - a)M B + Const. 



(•») 



Taking our fitted value for the slope of the correlation we find 
that a = 0.48 ± 0.06. This then implies that QSO and host 
galaxy luminosity are correlated. This has in fact been found 
from direct imaging studies of QSO host galaxies. Schade et 



al. (2000) find L ga i 



J QSO 



for a sample of low redshift 



X-ray selected AGN, with large scatter. This is a somewhat 
shallower slope than we find, however, we are in fact deriving 
the slope for the relation L ga i oc L^ ot , not L ga i oc Lq SQ . 
We can instead construct a model which assumes an exact 
power law correlation between L ga i and Lqso- However this 
does depend on us having some knowledge of the expected 
equivalent width for Ca II K in the host galaxy, without the 
AGN component. We cannot make an accurate assessment 
of the spectral properties of the host galaxy, as we have only 
fit one feature. Instead we use a Ca II K line strength derived 
from th e mean galaxy spec trum in the 2dF Galaxy Redshift 
Survey (Baldry et al. 2002). This has an equivalent width for 
Ca II K of 7.3±0.2A. The maximum Wx found for Ca II K in 
our analysis is ~ 4 — 5A, thus at the faintest luminosities the 
the host galaxy could be contributing a significant fraction 
of the continuum. If the equivalent width of the line in the 
galaxy spectrum is W / A ga i, then 



Wx = 



gal 



10 -0.4(M Tot -M gal ) ' 



(6) 



where Mxot and M ga i are the absolute B-band magnitudes 
from the total (AGN+host) and host components. Assuming 
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-22 

M □ Tota 



Figure 8. The relationship between W\(Ca, II K) and Mb for 
composites sub-divided by luminosity only. Three models are 



shown, Af„ a i 



BMqsq (solid line), M gal 



BM To 



(dotted line) and M ga i = A + BMqsq with parameters set from 
Schade et al. (2000) (dashed line). 



a linear relation between M ga i and Mqso such that M ga i 
A + BMqsq implies that 



M ga i — A — 2.5B log(10 



-0.4M Tot 



10" 



which can be solved numerically for M ga i, and substituted 
into Eq. ^. The resulting best fit is 



Al 



gal 



-11.10 ± 0.97) + (0.417 ± 0.045)M QSO , 



(8) 



which is shown in Fig. |§| (solid line). This fit is very similar 
to the previous power law fit (dotted line), only diverging 
at faint magnitudes. Also plotted is the relation found by 
Schade et al. (2000) of M ga i = -17.25 + 0.21M QS o, this 
is clearly discrepant with our data. Reducing the intrinsic 
strength of Ca II K to ~ 4.5A does not remove this discrep- 
ancy. We note that Schade et al. have demonstrated that for 
low redshift AGN (z ~ 0.1) at about L* the host galaxies 
are in almost all respects no different to normal galaxies. 
The one difference found is a bias towards spheroidal mor- 
phologies. This could imply that the stellar populations in 
AGN host galaxies are older than in average galaxies, but 
this is by no means certain given that AGN activity could 
also be accompanied by enhanced star formation. 

It is also possible to use the measured Ca II K line 
together with a mean galaxy spectrum to determine the ex- 
pected spectral properties of other features not produced 
by the AGN, in particular, the narrow forbidden oxygen 
emission lines. In the mean 2dF Galaxy Redshift Survey 
spectrum the flux emitted in [O III] A5007 is about a factor 
of two lower that the flux absorbed in Ca II K. However, 
even in our faintest composites where the host galaxies con- 
tribute the most, the [O III] line has an W\ a factor of over 
4 greater than Ca II K. Thus, assuming that the host galaxy 
SED is similar to that of a normal galaxy implies that the 
[O III] emission is coming from the AGN, rather than the 
host galaxy. 

The situation regarding the [O II] A3727 line is differ- 
ent. In the mean galaxy spectrum the line has a relative 
flux which is similar to that in the Ca II K line, and has an 
equivalent width of ~ 10.7 ± 0.3A. Comparing the correla- 
tions of Ca II K and [O II] we find that close to the faint 
end of the distribution, at Mb = -20, the [O II] W\ is 70- 
100% of what would be predicted from the host galaxy. As 
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Figure 9. The measured slopes of the log W\ vs. Mb correlation 
for different lines as a function of lower ionization potential for 
broad permitted lines (filled circles), narrow forbidden lines (open 
triangles) and semi-forbidden C III] (open square). 



the [O II] correlation with W\ is flatter than that of Ca II K 
the simple assumption of a constant host galaxy SED would 
predicit that the increased fraction of the [O II] flux is em- 
anating from the AGN at higher luminosities (~ 65% at 
Mb = —24). However, the difference is not as large as in the 
case of the [O III] line and it is possible to speculate that 
the [O II] line is formed in powerful star-forming regions and 
the star-formation rate in the host galaxy is increasing with 
AGN luminosity. In this case, all the [O II] emission could 
be due to the host galaxy. The [O II] line would then provide 
a very useful diagnostic tool for the study of star formation 
in high redshift QSOs. A further test of this would be to 
investigate the velocity distribution of the different narrow 
lines. If, for example, the [O II] line has a significantly lower 
velocity dispersion, more consistent with typical galaxies, 
than [O III] this would be good evidence of for the [O II] 
being due mostly to the host galaxy. Here we proceed by 
assuming that a major fraction of the [O II] line emission 
originates in the NLR of the AGN. The former possibility of 
the starburst origin will be investigated further by Corbett 
et al. (in preparation). 



5.2 The narrow-line region 

The lines that are thought to be emitted within the NLR 
of the AGN are [O II], [O III], [Ne III] and [Ne V] (see how- 
ever comment about [O II] in the previous section) . Of those, 
only [O II] and [Ne V] show clear and significant correlations 
with Mb. The partial correlation anaylsis for both lines is 
consistent with the hypothesis that the correlation is solely 
due to variations in Mb, with no correlation with redshift. 
The slope of the [Ne V] correlation for the magnitude only 
divided composites is 0.095 ±0.016, consistent with the cor- 
relation found in the Mb — z composites, which has a slope of 
0.101 ± 0.012. This line has the highest ionization potential 
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(here and below we consider the lower ionization potential; 
the energy required to ionize the lower ionization ion) of any 
of the narrow lines we investigate (126.21 eV). The [O II] line 
has the lowest ionization potential and the other two have 
intermediate values: 63.45 eV for [Ne III] and 54.93 eV for 
[O III]. The [Ne III] line shows only a marginal detection of 
a correlation (97% significant) and has a significantly flatter 
slope, 0.031 ± 0.017. The [O III] line shows no evidence of 
a correlation and all the measured equivalent widths in the 
Mb composites are within a range of Alog Wa = 0.15. Plot- 
ting the measured slope as a function of ionization energy 
for these narrow lines (open triangles in Fig. []) we see no 
obvious trend. However, if much of the [O II] emission is due 
to star formation, as suggested in the earlier section, there 
may be a correlation in a sense that a steeper slope corre- 
sponds to higher ionization energy. Given the small number 
of points and the additional assumption about [O II], we 
cannot use this trend to infer the NLR physics. 

Previous analyses have tentitively detected correlations 



0.15 



bet ween narrow line strength and luminosity (Green et al 
200l|)T]H owever, these were generally in data sets with a 
large fraction of non-detections, as they used single objects 
instead of composite spectra. The suggested correlation of 
line intensity with luminosity in [O III] demonstrated in Fig. 
^| does not appear to be borne out in the correlation analy- 
sis involving equivalent width measurements. The apparent 
variation could be due to a real variation in the velocity 
width of the [O III] lines while the total flux remains ap- 
proximately constant. 

A simple interpretation of the [Ne V] and [O II] corre- 
lation involves the 'disappearing NLR' model. This idea is 
based on the fact that the NLR size scales with the source 
luminosity to some power 



i^NLR = Ro(L/Lq) 



(9) 



which is reasonable given the similar correlati on found for 
the broad line region (BLR) ( Kaspi et al. 200C ) and the ac- 
tual observed NLR size in a number of sources. Luminosity 
scaling suggests 0.5 < (5 < 0.7 where the lowest value is ob- 
tained from the similarity of the narrow line spectrum in low 
and high luminosity AGN (e.g. Netzer 1990) and the j3 = 0.7 
is the value obtained by Kaspi at al. (2000) for the BLR. For 
nearby low luminosity Seyfert Is, Ro — 500 pc. Thus, the 5 
magnitude range in Mb observed in our sample translates to 
Rnlh = 5 — 13 kpc for the highest luminosity AGN, i.e. the 
scale of the entire galaxy, and indeed, recent HST imaging 
of the NLR in several radio-loud quasars show s equation (9) 
to hold for Knlr up to ~10 kpc with /3—0.5 (Bennert et al 



200 2j). ^ t is therefore possible that the NLR gas, if was there 
in the first place, has long left the galaxy and most high 
luminosity AGN contain weak or non-existent NLRs. More- 
over, if /3 = 0.7 as suggested here, and if the NLR density is 
independent of size, we can perhaps explain the decreasing 
equivalent width of [Ne V] as due to a decreasing ionization 
parameter with luminosity. 

The [Ne III] W\ correlation with luminosity is marginal 
although with the right trend (Fig. Q). However, the above 
simplified model does not explain the different behaviour of 
the [O III] line that shows no obvious correlation. We note, 
however, that the [Ne III], [Ne V] and [O II] lines are mea- 
sured over a larger magnitude range, compared with [O III], 
because the [O III] lines are lost off the red end of the spec- 
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Figure 10. The measured slopes of the log W\ vs. Mb correla- 
tion for different lines as a function of rest wavelength for broad 
permitted lines (filled circles), narrow forbidden lines (open tri- 
angles) and semi-forbidden C III] (open square). 



trum at much lower redshifts (and hence lower luminosi- 
ties in a flux limited sample) than the other emission lines. 
Thus, the reality of the model will have to be tested when 
more [O III] line measurements of higher luminosity AGN 
are available. Needless to say, if the model suggested here is 
confirmed by future observation, it will have serious impli- 
cations for searches for luminous type 2 QSOs which simply 
may not have luminous narrow emission line regions. 



5.3 The broad line region 

Lastly we consider emission from the BLR. Most of the 
strong emission lines we analyze emanate from this region 
and we will discuss each of the lines in turn. 

The Lya line is the only one to show a clear correlation 
with redshift (top left plot in Fig. We see an increase 
in Lycv strength with redshift. However, this is likely to be 
due to a combination of increased Lya forest absorption 
and our normalization of the continuum. As we have no 
knowledge of the continuum shape of our spectra, we cannot 
extrapolate a power law slope from the red to the blue side 
of Lycv. Thus, if absorption increases, our continuum point 
on the blue side of the line will be increasingly depressed, 
resulting in an over-estimate of the line flux. The N V line, 
close to Lya will also be effected by this problem, and is 
generally dominated by the errors in subtracting the Lya 
components. Previous work (Francis & Koratkar 1995) has 



found correlations between the strength of Lya and redshift, 
with higher redshift QSOs having weaker Lya. Francis & 
Koratkar attribute this largely to an increase in Lya forest 
absorption in their flux calibrated spectra. In our composite 
spectra, which are continuum divided, this same Lya forest 
absorption causes an underestimate of the continuum level. 
This in turn produces the positive correlation of W\ with 
redshift. 
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The Si IV+O IV] lines had to be fit as one feature, 
as they could not be de-blended. Apart from this they are 
relatively clear of contamination. This blend shows no evi- 
dence whatsoever for any correlation with luminosity, with 
equivalent widths which are constant to ~ 15% over a fac- 
tor of ~ 40 in luminosity. This is in disagreement with 
Green et al. (2001) who find a significant correlation with 
W\ oc £-°' 30 ± - 08 We find a shallower slope which is con- 
sistent with zero. We note that Green et al. measure their 
correlation over a range which is only a factor of ~ 10 in 
luminosity. 

The C IV line shows a highly significant correlation 
of equivalent width with luminosity, Wx oc £-0-i28±o.oi5 
The C III] blend (C III], Al III and Si III]) correlation is 
flatter, with Wx oc L" 070 * 008 . However, this blend also 
shows apparent significant correlations with redshift which 
could be, in some part, due to the combination of lines 
taken. For Mg II, we find Wx oc £-0-058±o.oi3 with signi f_ 
icant departures away from the correlation at low luminos- 
ity, Mb > —22. Checks of the individual line fits show no 
obvious systematic problems, and it appears that these de- 
viations from the simple power law are real. If they were also 
present in the C IV or C III] lines and occurred at the same 
luminosity we would not see the effect as the faintest QSOs 
which have these lines visible are too bright (Mb = —23 
to —24). An alternative cause is a change in the SED, e.g. 
of the host galaxy, but this effect is not seen in the narrow 
[Ne V] emission line which spans a similar range in luminos- 
ity. However, we also note that the Mg II emission line lies 
on top of the broad Fe II emission feature which we have 
treated as continuum. This Fe II emission may contribute 
a significant proportion (~25%) of this continuum emission 
and hence any variation in the strength of the Fe II emission 
as a function of luminosity will affect the Mg II equivalent 
width measurements. 

Perhaps the biggest surprise of this analysis is the 
Balmer line correlation which is markedly different to that 
of the broad UV emission lines. Although the broad H5 line 
shows no significant correlation, the equivalent widths of the 
H7 and H/3 lines show a positive correlation with luminosity. 
There is a hint that the strength of the correlation increases 
for lower order Balmer lines with Wx oc £ 0160 ± 020 an d 
Wx oc L 0185±0 030 for H 7 and H/3 respectively. This inverse 
Baldwin effect has not been seen before. Note that for the 
H7 line we included the narrow component and the [O III] 
A4363 line in the equivalent width, while for H/3 we subtract 
of the narrow component. However, the positive correlations 
with luminosity remain if the narrow H7 component is sub- 
tracted from the emission line or the narrow H/3 component 
is included. 

We plot the measured slopes of equivalent width depen- 
dence on luminosity against the ionization potential (filled 
circles in Fig. ^ for all the major emission lines in this study. 
Excluding the [O II] line, which could be contaminated by 
emission from the host galaxy, we see a correlation between 
the slope of the Baldwin relation and the ionization poten- 
tial. 

A possible explanation for this effect is that the SED 
of the ionizing continuum may steepen towards lower ener- 
gies with increasing luminosity, resulting in more photons 
being available to ionize hydrogen but relatively fewer with 
energies greater than 64eV available to ionize C IV. Alter- 



natively, the ionization parameter may change as a function 
of luminosity, as discussed below. 

On the other hand, the correlations we measure between 
equivalent width and luminosity may be caused by changes 
in the continuum flux under the lines rather than the line 
flux itself. We plot the slope of the equivalent width depen- 
dence on luminosity vs. the rest wavelength of the features 
in question (Fig. hof) . In this case, there may be a trend for 
the slope of the Baldwin relation to increase with decreasing 
wavelength. However, the broad lines show a much flatter 
correlation with wavelength than the narrow lines (even ex- 
cluding [O II]); arguing against an effect caused by simple 
continuum variation. 

Our results imply that either (a) L(H/3)/L(C IV) in- 
creases with luminosity, (b) L(4860A)/L(1550A) decreases 
with luminosity or a combination of the two. We have no 
way of directly answering this question since our data are 
not flux calibrated and we cannot measure the above lu- 
minosity ratios. However, we can refer to earlier findings 
discussing the line and continuum luminosities in smaller, 
less complete samples. 

Earlier studies of AGN SEDs (e.g. Vanden Berk et al. 
2001 and references therein) show that the slope of the power 
law continuum changes dramatically near to the H7 and H/3 
lines. Blue-wards of about 4500A the continuum slope has 
a ~ —0.5, while red- wards of this point the slope is more 
like a ~ —1.6. This has been interpreted as due to the dif- 
ferent continuum p rocesses con tributing to the SED at dif- 
ferent wavelengths (Laor 199C). At short wavelengths most 
of the emission is due to accretion disks (e.g. Laor & Net- 
zer 1989, and references therein) while at longer wavelengths 
dust emission, combined with non-thermal emission in radio- 
loud sources, is more important. Having this in mind we 
can speculate that the relative contribution is luminosity 
dependent in a sense that the accretion disk contribution 
is more important at long wavelengths in higher luminosity 
sources. This is exactly the trend observed by Laor (1990) 
in his study of the continuum emission in high luminosity 
AGN. Other ideas are related to the accretion disk incli- 



nation ( |Netzer 1985fc |Netzer et al. 199$ [Wilkes et al. 1999|) 



changes in the ionizing luminosity as a function of luminosity 
(lEspcy fe Adreadis 199s| ; |Green 199g| ; |Korista et al. 1994 
Wandel 1999) (considered above) or emission by optically 



thin gas ( |5hields et al 199! 



Most of these models predict 
a similar trend for all lines albeit with a different slope. No 
existing model, except for the one involving changes in the 
relative luminosity of the two components, can explain the 
observed difference between C IV and H/3. 

Another possibility is to test the dependence of optical 
and UV line ratios, such as L(H/3)/L(Lya), on luminosity 
and continuum shape as was done for example by Netzer 
et al. (1995) for a small (20) sample of radio loud AGN. 
That study shows a clear correlation of L(H/3)/L(Lya) with 
L(486lA)/L(1216A). However the small size of the sample, 
and the small luminosity range, prevents any clear conclu- 
sion. 

Finally, we note that the H/3 luminosity range and the 
C IV luminosity range are very different, with almost no 
overlap, because of the z — Mb correlation in our sample. It 
is therefore possible that the real Baldwin relationship for all 
lines is more complicated than previously assumed, showing 
both rising and falling branches. This may be related to the 
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unusual shape of the Mb — W\ curve of the Mg II line seen 
in Fig. 0. This idea can only be studied by obtaining good 
H/3 measurements in high luminosity AGN. 



6 CONCLUSIONS 

We have used composite QSO spectra to make the most 
accurate determination of line and continuum correlations 
to date. We see the Baldwin effect in a number of lines. 
In general the equivalent width correlations are primarily 
with luminosity and not redshift. The broad UV lines gener- 
ally show strong anti-correlations with luminosity, although 
somewhat flatter than previous determinations. The Balmer 
line equivalent widths, in contrast, show an inverse Baldwin 
effect, and are positively correlated with luminosity. We pos- 
tulate that this difference could be due to a different combi- 
nation of disk and non-disk components in AGN of different 
luminosity. 

Some, but not all, narrow forbidden lines also show anti- 
correlations with luminosity. A possible explanation is that 
the NLR becomes more extended, and fainter by compar- 
ison, at high luminosity. This has important implications 
concerning the possible detection of type 2 QSOs at high 
redshifts. By comparing the strength of the Ca II K ab- 
sorption line to the [O II] emission line via a mean galaxy 
spectrum we find that at low luminosities most, if not all, of 
the [O II] flux could come from the host galaxy and not the 
AGN. This raises the possibility that a large fraction of the 
observed [O II] in high luminosity AGN is due to enhanced 
nuclear star-formation. Using the Ca II K line and assuming 
a constant SED for the host galaxy, we are able to derive the 
correlation between host galaxy and AGN luminosity which 

;, f , „ r0.417±0.045 

A number of areas have still to be investigated. First is 
the detailed shapes and positions of the lines, which will be 
discussed in a forthcoming paper (Corbett et al. in prepa- 
ration). Secondly, after looking at the mean properties as 
a function of redshift and luminosity, we should also inves- 
tigate the variance about this mean by fitting individual 
spectra. 
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